Experimental Realization of the Quantum Box Problem 
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Abstract 

The three-box problem is a gedankenexperiment designed to elucidate some interesting features of quantum measurement and 
locality. A particle is prepared in a particular superposition of three boxes, and later found in a different (but nonorthogonal) 
superposition. It was predicted that appropriate "weak" measurements of particle position in the interval between preparation 
and post-selection would find the particle in two different places, each with certainty. We verify these predictions in an optical 
experiment and address the issues of locality and of negative probability. 



Weak measurements have been controversial ever since 
the concept was developed by Aharonov, Albert, and 
Vaidman (AAV) Q. In contrast to the usual, von 
Neumann, approach to measurement, weak measurement 
uses an apparatus whose pointer has a very large quan- 
tum mechanical uncertainty when compared with its typ- 
ical shift. After the system-pointer interaction, the shift 
in the pointer position is much smaller than its initial 
uncertainty and almost no information is gained about 
the quantum system. Nevertheless, after a sufficiently 
large number of measurements on an ensemble of iden- 
tically prepared quantum systems, the mean pointer po- 
sition can be determined to any degree of precision. In 
such a measurement strategy, one sacrifices knowledge of 
the value of an observable on any given experimental run 
to avoid entanglement with the measurement device and 
the ensuing 'collapse' of the wavefunction. In particu- 
lar, this makes it possible to contemplate the behavior 
of a system defined both by state preparation and by 
a later post-selection, without significant disturbance of 
the system in the intervening period. 

AAV calculated the shift in the pointer of a mea- 
surement apparatus that weakly measured an observable 
A between two strong measurements. The initial strong 
measurement pre-selects (or prepares) the state, , of 
the quantum system and the final strong measurement 
post-selects the quantum state, In between, con- 

sider a von Neumann-style interaction Hamiltonian of the 
form 



Hi = gAP x , 



(1) 



where A is the hermitian operator corresponding to an 
observable A of the quantum system, g is a (real) cou- 
pling constant, and P x is the momentum operator conju- 
gate to the pointer position X. In the absence of postse- 
lection, the effect of having this measurement interaction 
on for a time T (assumed short enough that A is constant 
during the measurement) shifts the pointer position by 

an amount Ax = K i^A^ = gT (^A\ ,such that one can 

infer a value for A by dividing the pointer shift by the in- 
teraction strength K. The main result of AAV's seminal 
work is that for sufficiently weak coupling strength K 
and in the presence of postselection, the inferred value of 



A is given by the following expression, which they called 
the "weak value" : 
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When the two strong measurements pre- and post-select 
the same state, Aw reduces to the usual quantum me- 
chanical expectation value for the operator A. How- 
ever when the pre- and post-selected states differ, weak 
values can take on surprising values that are not con- 
strained to lie within the eigenvalue spectrum of the op- 
erator or even to be real numbers. The surprising char- 
acter of weak values has led to skepticism about whether 
they should be considered proper measurements 
In spite of their controversial nature, a weak measure- 
ment has been experimentally performed by Ritchie et 
al. ; in addition, they have been useful in correctly de- 
scribing orpredicting surprising experimental outcomes 

Mill aim a 

The "Quantum Box Problem" was developed by 
Aharonov and Vaidman [l2L IT^ | ; it is a deceptively sim- 
ple thought experiment that elucidates some of the odd 
behavior which may result from studying post-selected 
systems. In the 3-box version, a particle is prepared (pre- 
selected) in an equally weighted superposition of being in 
one of three orthogonal quantum "boxes", A, B, and C, 
i.e., = 4j \A) + 4- \B) + 4- \C) and post-selected 

in the final state \ip f ) = 4- \A) + 4^ \B) - 4^ \C). The 
weak value for the particle being in box A can be calcu- 
lated by inserting the projection operator \A) (A\ in Eq. 

m i-e., A w = (V/MO Wi>/(^/IV>i) = +1. In this 
case, the pointer shift is the same as in the case when the 
particle was definitely in box A. We call this quantity the 
"weak probability for the particle in box A" and use the 
shorthand Paw, where A represents the rail. This weak 
probability is already strange since a strong measurement 

would find the particle in box A only |l/V3| = 1/3 of 
the time. Through a similar calculation, the weak prob- 
ability for the particle in box B, Pbw, is also found be 
be +1. Like normal probabilities, the sum of all weak 
probabilities is +1 (because the sum of all orthogonal 
projectors is the identity); therefore, if one performs a 
weak measurement of the particle in box C one finds the 



1 



weak probability, P cw = ("0/JC) (C\ip. t ) / {i>f\i>i) = -1- 
Such a result is very strange since it lies outside the range 
of eigenvalues for projection operators and normal prob- 
abilities - i.e., it does not lie between and 1. Neverthe- 
less, for a large enough ensemble of identically prepared 
and post-selected states, this result properly predicts the 
outcome of our experiment. Note that while apparently 
similar to the situation for quasi-probability distributions 
such as the Wigner function, this case is different, refer- 
ring instead to a directly observable measurement out- 
come. There are also other situations in which it has 
been suggested that negative probabilities might be use- 
ful for resolving locality "paradoxes" ^4|. For a review 
of such extensions to probability theory, see |Tfij . 

We use a linear optical interferometer to implement 
the quantum box problem (Fig. 1). This interferome- 
ter is similar to the Mach-Zehnder interferometer except 
that it has three optical paths (or rails) instead of two; 
we label these rails A, B, and C. A photon can be pre- 
pared in any superposition of these three rails by proper 
selection of the characteristics of beamsplitters BS1 and 
BS2. Similarly, any coherent superposition can be post- 
selected by controlling the characteristics of BS3, BS4, 
and the optical path lengths (relative phases). We use 
the transverse displacement of the photon as our mea- 
surement pointer. Measurements are carried out by tilt- 
ing one of the glass plates (GP A, B, or C); this results 
in a controllable transverse shift of the beam in one of 
the rails. For example, a glass plate in rail A displacing 
the beam along X can be thought of in terms of an ef- 
fective interaction Hamiltonian of the form g \A) (A\ Px- 
The measurement of the mean spatial shift of an ensem- 
ble of photons can be viewed as a measurement of the 
probability for each photon to have been in rail A. If the 
transverse shift is much less than the beam size (the un- 
certainty in the position of the individual photons), this 
is a weak measurement. Shifting only one rail at a time, 
we measure the size of that shift while keeping the other 
two rails blocked. This characterizes the strength Ka 
of the system-pointer interaction. We then combine all 
three beams with the correct relative phases for proper 
post-selection and again measure the shift Ax. The ra- 
tio of the shift in the post-selected state to that of the 
single rail constitutes our weak probability for that rail 
Aw = Ax/Ka- A negative weak probability is realized 
when the shift in the post-selected state is in the opposite 
direction to the shift on the individual rail. 

For the experiment, light from a 780nm, 30mW, diode 
laser is spatially filtered to yield a collimated TEMoo 
gaussian beam with a waist of 380/im; this amounts to a 
large ensemble of identically-prepared photons. To cre- 
ate the appropriate superposition, the transmissivity and 
reflectivity of each beam-splitter is adjusted by using two 
half- wave plates (A/2) prior to polarizing beam-splitters 
(PBS). The third half- wave plate is present to rotate the 
polarization in rail C from horizontal to vertical so that 
all three rails have the same polarization. Glass plates 
with thicknesses 10mm, 6.5mm, and 10mm are situated 



in paths A, B, and C respectively. The plate in rail A 
displaces the beam in the vertical direction (out of the 
plane of the inteferometer) while the plates in rails B and 
C displace the beam in the horizontal direction (in the 
plane of the interferometer) . In addition to these glass 
plates, 1-mm-thick microscope slides are located in rails 
A and C. These plates can also be finely tilted to set the 
relative phases, <f> A and <p c , between the light in these 
arms with that in rail B without significantly displacing 
the beam. Rails A and B are recombined coherently at 
beam-splitter (BS) 3. One of the outputs from BS3 is 
then recombined with the light from rail C at BS4. The 
phases are chosen such that the light in the right-going 
output of BS4 is our desired post-selected state. This 
beam is magnified by a factor of 3 on a screen behind 
which a CMOS camera (Logitech Quickcam) captures 
beam images. The second output of BS4 terminates 
at a photodiode that is used to set the relative phases of 
the three arms of the interferometer. 

The quantum box problem can be generalized for an ar- 
bitrary pre-selected state \(pA = a, \ A)+bi \B)+Ci \C) and 
post-selected state \(pA = af\A) + bf\B) — c/|C). If the 
coefficients are real numbers, we obtain the same weak 
values discussed earlier for rails A, B, and C if c^o,/ = 
bibf = CiCf, albeit with a lower overlap between our pre- 
and post-selected states. These requirements can be con- 
verted into experimental parameters in the following way. 
Given reflection and transmission amplitudes rj exp(i<f> ri ) 
and ti exp(i(j) ti ) for BSi, the 3-path interferometer pre- 
selects the state |<^) = r x \A) + t x r 2 \B) + ht 2 \C) - 
the phases are compensated for by optical path lengths. 
For the proper phase settings, the interferometer post- 
selects the state \<Pt) — ten \A) + r 3 r 4 \B) — t± \C) . To 
satisfy the condition a%af = bibf = CiCf, we require 
rit^r^ — ii^2^3^4 = txtit^. This is the condition that 
each path contributes the same intensity in the camera 
output port. In our experiment, the final beamsplitters 
are both 50/50, and therefore our post-selected state is 
approximately \<fif) = § 1-4) + ^ \B) — -j= \C) . The proper 
pre-selected state to obtain the desired weak values is 

\<Pi) = \jl \ A ) + v/| \ B ) + J\\ C )- We note that this 
reduces the overlap of the initial and final states from 
1/3 for the original states to y/l/lQ. 

The data for this experiment were taken in two parts. 
In both parts, we began by balancing the intensities in 
the camera arm from the three paths and aligned the 
beams in the interferometers such that they overlapped 
to better than 1/10 of their rms widths. All of the beams 
in the interferometer were vertically-polarized. Pairwise, 
beams A&B and B&C had interference fringe visibili- 
ties of about 95%. In the first part of the experiment 
we performed only single weak measurements on each 
of the 3 rails. A single glass plate (GP) in one of the 
arms was tilted to displace the beam in that path. The 
phases in the interferometers were set using the micro- 
scope slides so that paths A and B interfere construc- 
tively and paths B and C interfere destructively in the 
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camera output. Images of each individual beam (in the 
absence of interference) and the properly post-selected 
beam were recorded using the camera. In Fig. 2, hori- 
zontal profiles arc shown, with a particular displacement 
of rail C, for the beams in rails A (thin solid line), B (thin 
dashed line), and C (thick dashed line) alone, and also 
for the post-selected state (thick solid line). Beams A 
and B have the same average position to within less than 
1 pixel, or about 1/20 of the rms width. Beam C has 
been displaced from this centre position by —11.1 pixels 
or —0.69 rms widths. This is the coupling strength Kc- 
The post-selected state beam profile is displaced by +7.0 
pixels or +0.44 rms widths in the opposite direction of rail 
C; in other words, the observed value Ax/Kc = —0.64. 
This differs from the expected value Pew = — 1 because 
the displacement is already approaching the transition 
from the weak to the strong measurement regime. 

We have summarized our results for post-selected mea- 
surements of Pa, Pb, and Pc for a variety of coupling- 
strengths in Fig. 3. The observed displacements Aa; for 
measurements of Pa and Pb are shown as open circles 
and solid triangles respectively, as a function of Kia,b\- 
The theoretical prediction for these two rails is a straight 
line with a slope of 1 and a y-intercept of 0. The slope 
of 1 indicates a shift as large as if all the photons had 
traversed the shifted rail, that is, Paw — Pbw = 1- 
Note that this slope of 1 persists even into the strong- 
measurement regime, K{ A .b} > 1 rms width. This can 
be understood from the following argument. If we per- 
form a weak measurement on rail A, then in our post- 
sclcction output the amplitudes from rails B and C in- 
terfere perfectly destructively and we are left only with 
the field from rail A. Therefore, whatever displacement 
rail A has, so will the post-selected state ^3| . Of course a 
similar argument can be constructed for rail B. The data 
for rail C has different behaviour and no such argument 
applies. The experimental data is shown as solid cir- 
cles and the theoretical prediction as a solid line. Near 
zero displacement of the rail, the displacement of the 
post-selected state is in the opposite direction as the dis- 
placement of rail C. The weak probabilities are given 
by the slopes of the curves from the plot in Fig 3 near 
the origin. The weak probabilities are consistent with 
the predictions of +1, +1, and —1 for Paw, Pbw, and 
Pew respectively. For larger displacements in rail C, 
the weakness criterion is not satisfied, and the observed 
shifts eventually approach a strong-measurement limit of 

1/5 m. 

We have shown that the weak probability to find a par- 
ticle in rail A is +1, as is the the weak probability to find 
the particle in rail B. This begs the question, what is 
the weak probability to find the particle in both arms? 
It is well-known that a single particle cannot trigger two 
different spacelike separated detectors; in fact one might 
think of that as the defining quality of a particle - it 
must be either here or there but never both. Therefore, 
one would never expect to find a single photon in both 
rails A and B via projective measurements. It has been 



suggested, however, that this is a limitation of strong 
measurements; if a particle is prepared in a suitably de- 
localized state, might it be possible to find that particle 
in two different places at the same time if the presence 
of the particle is probed weakly |l7j? 

For a state involving only one particle, we can use the 
operator \A) (A\B) (B\ to find the joint weak probabil- 
ity for the particle to be in rails A and B simultane- 
ously. However, because \A) and \B) represent orthog- 
onal states, this operator is identically zero. Therefore 
the weak probability to find the particle in both rails is 
expected to be zero even when the single weak probabil- 
ities (i.e., Paw and Pbw) are both equal to 1. This 
seeming violation of standard rules of probability can be 
seen to be related to the relaxation of the non-negativity 
axiom of probability theory [8|, llq . 

In the second part of the experiment, we performed 
two different simultaneous measurements on the photons 
using two different degrees of freedom at the same time. 
In addition to the transverse beam displacement in rail B, 
we used the waveplate in rail C (see Fig. 1) just before 
BS4 to rotate the polarization of the light by a small 
amount. We prepared the same initial state as before 
but use a different final state, \1pf2) = ~\ I A) + \ \B) + 
-j= \C). Such a final state swaps the roles of rails A and C 
so that Paw — — 1 and Pew — +1- The reason for doing 
so was purely technical. Our final beamsplitters were 
50/50 for vertically-polarized light but for horizontally- 
polarized light they had greater than 90% transmission. 
We ensured that rail C, where the polarization rotation 
occurs, was transmitted rather than reflected, so that the 
observable polarization shift at the camera would not be 
negligibly small. We rotated the waveplate in rail C by 
a small amount that resulted in a Kc = 9.6° rotation in 
the polarization of the light after the final beamsplitter 
when rails A and B were blocked. 

We measured the displacement, Ax, of the post- 
selected state as a function of the rail-B coupling strength 
Kb- The results, Ax/Kb, are the inferred probabilities 
of the photon to be in rail B and are shown in Fig. 4., 
both in the case of no final polarizer (solid circles, solid 
line theory) and in the case of a final polarizer oriented to 
block vertically-polarized light (open circles, long-dashed 
line theory). Recall that all of the beams in the interfer- 
ometer initially had vertically-polarized light. When the 
polarizer was inserted to block vertically-polarized light 
we measured the beam displacement of only the light that 
had its polarization rotated, i.e., only photons which had 
traversed rail C. We also show the inferred probability 
of the photon to be in rail C - given by the polarization 
rotation of the final state divided by Kc = 9.6° (open 
triangles, dotted line theory). The low data point for 
the Pb was most likely due to a mode-hop-induced phase 
fluctuation, and the slight disagreement between the the- 
ory and experimental values for Pc is from imperfect in- 
terference. These data show, that for displacements of 
rail B in the weak regime (less than 0.5 rms widths), 
the entire distribution of the light is both displaced and 
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polarization rotated as if the photons really experience 
both weak measurements. However, by blocking all of 
the vertically-polarized light, we clearly see that the dis- 
tribution is not shifted at all. Therefore we can say that 
those particles that were definitely in rail C were not even 
weakly in rail B. 

We have implemented the quantum box problem in a 
3-rail interferometer and verified some of the important 
predictions about the weak measurements one can per- 
form in this system. Specifically, we have observed weak 
probabilities of +1 for rails A and B and a weak probabil- 
ity of —1 for rail C. We have also studied the transition 
out of the weak-measurement regime for rail C, when the 
interaction shifts the measurement pointer by more than 
about 0.5 rms widths. In addition, we performed two 
simultaneous measurements on two of the rails using po- 
larization and transverse displacement as pointers. For 
small displacements, we found that the entire beam was 
shifted and polarization rotated. However, when only 



the polarization-rotated light was studied, its transverse 
displacement was zero. This shows the (one-sided) anti- 
correlation between a strong measurement of the particle 
in one rail and a weak measurement of it in another. This 
leaves open the question of what one would observe by 
making joint weak measurements on two different rails. 
In another work, we have proposed an experimen tally 
feasible method for performing such measurements [19j, 
and an ongoing experiment aims to carry them out for 
the 3-box problem. These types of conceptually simple 
experiments yield insight into the sometimes shocking 
behavior of post-selected subensembles in quantum me- 
chanics, and into the power and the limitations of weak 
measurements. 
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Figure 1. The 3-rail Mach-Zehnder-style inter- 
ferometer. The TEMoo mode of a diode laser is 
filtered spatially using a pinhole (PH), two lenses 
and an iris. Light is placed in the proper coher- 
ent superposition of each of the rails labelled A, 
B, and C using half-wave plates (A/2) and polar- 
izing beam-splitters (PBSs). Glass plates (GPs) 
in each arm are used to displace each beam trans- 
verse to its direction of propagation, and micro- 
scope slides (MS) are used to finely adjust the 
phases in arms A and C. The three modes are 
recombined at two 50/50 beamsplitters (BS3 and 
BS4). The beam shines on a screen and a CMOS 
camera behind that screen captures its image. A 
photodiode (PD) in the second port of BS4 is 
used to set the relative phases light in the differ- 
ent rails. 

Figure 2. Sample experimental data when only 
rail C is displaced. We show experimental data 
for individual horizontal beam profiles from Rails 
A (thin solid line), B(thin dashed line), and C 
(thick dashed line) when a transverse, horizon- 
tal displacement of beam C is applied. The 
beam profile of the post-selected beam is also 
shown (thick solid line). The beam profile from 
rail C is displaced by —11.1 pixels, or —0.69 rms 
widths, which constitutes a measure of the cou- 
pling strength Kc- The corresponding shift Ax 
in the post-selected beam is +7.0 pixels, or +0.44 
rms widths, implying a probability of Pq = —0.64. 
(This is smaller than the expected Pew = — 1 be- 
cause in this image, the coupling strength is too 
large for a true weak measurement. This im- 
age is shown for clarity, while Figure 3 summa- 
rizes the data for weaker couplings as well, where 
the agreement with weak-measurement theory is 
good.) 

Figure 3. Experimental data for individual rail 
displacements. The displacement of the post- 



selected state is shown as a function of the dis- 
placement of each individual rail. The data for 
rails A, B, and C are shown as open circles, solid 
triangles, and solid circles respectively. Theory 
(with no adjustable parameters) is shown as a 
dashed line for rails A and B and as a solid line 
for rail C. The transition for the displacement of 
rail C from the weak measurement regime to the 
strong measurement regime is clearly visible at a 
beam displacement of about ±1 rms width. No 
such transition was observed or predicted for the 
other two rails. 

Figure 4. The weak probabilities for the pho- 
ton to be in rail B and rail C measured in the same 
experimental setup using two different pointers. 
The weak probability for the photon to be in rail 
C was measured using a small polarization rota- 
tion, and the probability for rail B was measured 
with a transverse beam displacement. Both are 
shown as functions of the displacement of rail B, 
with the polarization rotation being kept fixed. 
The resulting polarization rotations and displace- 
ments in the post-selected states were converted 
into weak probabilities by dividing by the rele- 
vant interaction strengths. The resulting weak 
probabilities for rails B and C are shown as solid 
circles and open triangles. For very weak mea- 
surements, the entire post-selected state is polar- 
ization rotated and displaced, by amounts which 
would imply that each photon was both in rail 
C and rail B with certainty. However, when a 
polarizer is inserted to block vertically-polarized 
light (such that we only detect the photons that 
have been polarization-rotated in rail C) we find 
the weak probability for the photon to have been 
in rail B to be zero (open circles, dashed line). 
Thus, those photons that were definitely in rail C 
were definitely not in rail B. 
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